XPath Query Satisfiability and Containment under DTD Constraints
نویسنده
چکیده
In this thesis, we consider the XML query language XPath, along with XML documents whose integrity constraints are presented in the form of document type definitions (DTDs). In particular, we study the problems of XPath satisfiability and XPath containment in the presence of DTDs. The motivation for studying XPath is that it is the main language for navigating in and extracting information from XML documents. The motivation for studying DTDs, as opposed to some other newer XML schema formalism, is that DTDs are well-known, stable, and well-tested. Furthermore, they are still widely-used in various domains. The major contributions of the thesis can be classified into those on satisfiability and those on containment of XPath queries under DTDs. With respect to the satisfiability problem, we show that the XPath satisfiability problem for the fragment XP{/,[ ]} is NPhard in general. In order to study whether this worst-case behaviour arises often in practice, we investigate real-world DTDs and discover that the majority of them satisfy a property we called the covering property. We then show that XPath satisfiability for the fragment XP{/,[ ],∗,//,∪} is in PTIME under covering DTDs. We also show that it is decidable in PTIME for duplicate-free DTDs (a property introduced in [91]), which also occur often in practice. Despite the positive results for the satisfiability problem, we prove that XPath containment under covering DTDs for XP{/,[ ]} is still coNP-hard. However, we define a class of DTDs, called well-behaved DTDs, under which containment for XP{/,[ ]} is tractable
منابع مشابه
Containment for XPath Fragments under DTD Constraints
The containment and equivalence problems for various fragments of XPath have been studied by a number of authors. For some fragments , deciding containment (and even minimisation) has been shown to be in ptime, while for minor extensions containment has been shown to be conp-complete. When containment is with respect to trees satisfying a set of constraints (such as a schema or DTD), the proble...
متن کاملSpeeding up XML querying: satisfiability test & containment test of XPath queries in the presence of XML schema definitions
This dissertation develops approaches to testing the satisfiability and the containment of XPath queries in the presence of XML Schema definitions in order to speed up XML querying. XML provides a simple yet powerful mechanism for information storage, processing and delivery, and is a widely used standard data format. XPath is a basic language for querying XML data, and is embedded into many W3...
متن کاملSpeeding up Xml Querying
This dissertation develops approaches to testing the satisfiability and the containment of XPath queries in the presence of XML Schema definitions in order to speed up XML querying. XML provides a simple yet powerful mechanism for information storage, processing and delivery, and is a widely used standard data format. XPath is a basic language for querying XML data, and is embedded into many W3...
متن کاملA Decision Procedure for XPath Satisfiability in the Presence of DTD Containing Choice
XPath satisfiability is one of the most basic problems of XML query optimization. A satisfiability decision framework, named SAT-DTD, is proposed to determine, given a set of XPath queries P and a DTD τ, which subset of P are satisfiable by an XML tree conforming to DTD τ. In the framework, an indexed NFA is constructed from the set of XPath queries P, and then the NFA is driven by simple API f...
متن کاملRegular XPath: Constraints, Query Containment and View-Based Answering for XML Documents
In this paper we consider a powerful mechanism, called Regular XPath, for expressing queries and constraints over XML data, including DTDs and existential path constraints and their negation. Regular XPath extends XPath with binary relations over XML nodes specified by means two-way regular path queries. Our first contribution deals with checking satisfiability of Regular XPath constraints. Whi...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2014